Executable Code Actions Elicit Better LLM Agents

This work proposes to use executable Python code to consolidate LLM agents' actions into a unified action space (CodeAct). (Abstract)

テキストやJSONで出力する代わりに実行可能なPythonコード

CodeActAgent, finetuned from Llama2 and Mistral, is integrated with Python interpreter and uniquely tailored to perform sophisticated tasks (e.g., model training) using existing libraries and autonomously self-debug.

https://github.com/xingyaoww/code-act

We implemented a containerized code execution engine based on JupyterKernelGateway.

https://github.com/xingyaoww/code-act/blob/main/scripts/chat/code_execution/start_jupyter_server.sh

https://github.com/xingyaoww/code-act/blob/main/scripts/chat/code_execution/api.py

jupyter-kernel-gateway

JupyterKernelなどを実装している！ https://github.com/xingyaoww/code-act/blob/main/scripts/chat/code_execution/jupyter.py#L55

2. CodeAct Makes LLMs Better Agents（Figure 1）